Semantic-Preserved Communication System for Highly Efficient Speech Transmission

نویسندگان

چکیده

Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years. In contrast to traditional wireless that focus on abstract symbols, approaches attempt achieve better efficiency by only sending semantic-related information source data. this paper, we consider semantic-oriented which transmits semantic-relevant over channel recognition task, a compact additional set semantic-irrelevant reconstruction task. We propose novel end-to-end DL-based transceiver extracts encodes from input spectrums at transmitter outputs corresponding transcriptions decoded receiver. particular, employ soft alignment module redundancy removal extract text-related features while dropping semantically redundant content, greatly reducing amount compared existing methods. also correction further correct predicted transcription with knowledge leveraging pretrained language model. For transmission, include CTC small number but speech-related information, such as duration, pitch, power speaker identification original signals introduce two-stage training scheme speeds up proposed DL The simulation results confirm our method outperforms current terms accuracy text quality recovered significantly improves efficiency. More specifically, sends 16% transmitted symbols required achieving about 10% reduction WER transmission. it an even more remarkable improvement 0.2% preserving comparable reconstructed signals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic typology and efficient communication

Cross-linguistic work on domains including kinship, color, folk biology, number, and spatial relations has documented the different ways in which languages carve up the world into named categories. Although word meanings vary widely across languages, unrelated languages often have words with similar or identical meanings, and many logically possible meanings are never observed. We review work s...

متن کامل

Portable light transmission measuring system for preserved corneas

BACKGROUND The authors have developed a small portable device for the objective measurement of the transparency of corneas stored in preservative medium, for use by eye banks in evaluation prior to transplantation. METHODS The optical system consists of a white light, lenses, and pinholes that collimate the white light beams and illuminate the cornea in its preservative medium, and an optical...

متن کامل

Highly Efficient Prion Transmission by Blood Transfusion

It is now clearly established that the transfusion of blood from variant CJD (v-CJD) infected individuals can transmit the disease. Since the number of asymptomatic infected donors remains unresolved, inter-individual v-CJD transmission through blood and blood derived products is a major public health concern. Current risk assessments for transmission of v-CJD by blood and blood derived product...

متن کامل

Rich Semantic Models and Knowledgebases for Highly-Structured Scientific Communication

Rather than using text for scientific research reports, we have proposed developing highly-structured reports with rich semantic models. In this paper, we consider detailed structures for the components of research reports using a modeling framework based on a rigorous upper ontology. For instance, we consider the use of structured descriptions of Research Designs to support evaluation of inter...

متن کامل

System Description: A Highly Interactive Speech-to-Speech Translation System

Spoken Translation, Inc. (STI) of Berkeley, CA has developed a commercial system for interactive speechto-speech machine translation designed for both high accuracy and broad linguistic and topical coverage. Planned use is in situations requiring both of these features, for example in helping Spanish-speaking patients to communicate with English-speaking doctors, nurses, and other health-care s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Journal on Selected Areas in Communications

سال: 2023

ISSN: ['0733-8716', '1558-0008']

DOI: https://doi.org/10.1109/jsac.2022.3221952